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-2-11 5 10 

MHMKNI KKNQVM N L 6 P N S 

20 30 
KLLKEYKSOLIELNIEOFEA 

10 50 
GiGLiLGDAYiRSRDEGKTY 

60 70 
CMQFEWKNKAYMDHVCLLYC 

80 90 
OWYLSPPHKKERYNiiLGNLY 

. 100 110 

ITW6AQTFKH0AFNKLANLF 
, , 120 130 

IVNNKKIIPNNLVENYLTPM 

1^*0 150 
GLAYWPMDDGGKWDYNKNSi 

160 170 
NKSIVLNTQSFTFEEVEYLV 

180 190 
K6LRNKFQLNCYVKINKNKP 

200 210 
IIYIDSMSYLIFYNLIKPYL 

220 230 
IPQMMYKLPNTISSETFLK* 

Positions that can be chapged without affecting enzvme activity (demonstra ted) 
Dosltlons -1 and -2 are not natural. ihe two amino acids are added due to 

cloning strategies 

positions 1 to 10: can be deleted 
position 36: 6 Is tolerated 
position ^40: R or V are tolerated 
position k\\ S or N are tolerated 
position ^3: 7^ Is tolerated 
position 46: V or N are tolerated 
position 91: 5 Is tolerated 
positions 123 and 156: £ are tolerated 
position 223: A and S are tolerated 

Changes that affect enzvme activity (demonstrated) 

position 19: L to S 

position 38: T to 5 or N 

position 39: G to D or E 

position 40: C to Q 

position 42: C to R 

position 44: D to E £ or H 

position 45: A to E or D 

position 46: Y to D 

position 47: T to K or N 

position 80: r to S 

position 1^*4: D to E ^ \ CI ^ 

position 1«<5: B to E r I O, O 

position 145: G to E 
position 147: 5 to S 
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Group I Intron Encoded Endonucleases and Related Endonucleoses 



ENDONUCLEASE 



RECOGNITION SEQUENCE 



CLEAVAGE 



SITE 



V INTRON SfTE 



in 



o 

CL. 
GO 



o 



CD 



1 



I-Sce I 
(Sacchoromyces mitochondrio) 

I-Sce IV 
(Saccharomyces mUochondria) 

I-Sce It 
(Saccharomyces mitochondria) 

I-Ceu I 
(Chlomydomonas chloroplasl) 

I-Ppo I 
(Physarum nucleus) 

I-Sce rn 

(Saccharomyces mitochondria) 

I-Cre 1 
(Chlomydomonas chloroplasl) 



CGCITAGGGAjSj&lcAGGGTAATIA TAGC 
GCGAT CCCITAT TGI CCCAT TAT ATCG 



TTCTCAT GAuSIgCT CTAAT CC AT GG 
AAGAGT AC|TAATCGAGATTAGG TACC 

CfTTTGGT'^:AX£LdAGAAGTAjTAT AT T T 
G IaA ACCAG|TAGGT CTT CATl ATA TAAA 

TAACGGT^CJAaIgGT AGCGAAA TTCA 
ATT GCCAGIGAT TCCAT CGCTTT AAGT 

TGIACTCT CTJAjSJGGT AGCC^AA TGCC 
A c |t G AG a G[A at T C C A T CGGf T T T ACGG 



GGAGGTTTTGGTAACT ATTTAT TACC 
CCT CCAAAACCATT GATAAATA AT GG 

V 

GGG TT CAAAACGT CGTGAGACAGTT T 
CCCAAGT TTTGCAGCACTCTGT CAAA 



Endo. See I(RF3) 
(Saccharomyces mitochondria) 
(Non inlronic) 

HO 

(Sacchoromyces nucleus) 
(Non intronic) 



GAT GCT GTA£iLdAT AGGCTTGG TT AT 
CTA CGACAlT CCGT AT CCGAACC AAT A 



ClTT TCCGCAAXLaIgT 



GAA AGGCGTTGTCAT ATTAAAA 



AT AATTTT at AA 
TATT 



I-Csm I 
(Chlamvdomonos mitochondria) 
(Pufolive endonuclease) 

I-Pan I 
(Podosporo mitochondria) 
(Putative endonuclease) 



V 

ACCAT GGGGTCAAAT GTCTTT CTGGG 
TGGTACCCCAGTT T A CAGAAAGACCC 

V 

GTGCCT GAATGAT AT TTATTACCTTT 
CACGGACTT ACTAT A AAT AAT GGAAA 



to 



(Bacleriophoge T4) 



I Tev I 



I Tev n 



CA 
GT 



GCTCAGTAGATGTTTTCTTGGGTC TACCGT TT AAT 
CGAGT CAT CTA CAAA AGAA CCCAGATGGCAAATTA 



I Tev in 



CAAGCTTATGAGTATGAAGT G A ACAC GJJT A T T 
GT TC GAAT ACT CATACT T C A C T T GT GteA AT A A 

I V 
GCTATTCGTTT TlLAT GTATCTTTTGCGTGTAGCTTTAA 
CGATAAGCAAA A ATlACAT AGAAAACGCACATCGAAAT T 



FIG. 6 



J 



EXPRESSION VECTORS 

EcoRI EcoRI 




FIG. 7 




Pst I 2848 



FIG. 8 



.9 
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Drd I 2721 




FIG. 9 
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Sou3A I 
Mbo I 
Dpn n 

ScrF I 

Nci I 

Msp I 

Hpa n 

Dso V 

BstK I 
Xmo I 
Sma I 
ScrF I 
Nci I 
Dso V 

Rsa I Dpn I 
NIo IV NIq IV 
Sac I Csp6 I Alw I 

HgiA I BstK I Taq I 

Ed 136 I BsaJ I Sal I 

Sph I Rma I Kprt I BstY I Hinc U 
NspC I Bspl286 I Ben I Sfe I 
EcoR I XbQ I Ban I BamH I Acc I I-Sce I 

Apo I NIo m Bon II Ben I Sfc I Rma I 

Toq I Nsp7524 I Atu I Ave I Pst I Hgo I 

Alu I Nsp I Bfo I Asp718 Alw I BspW I Bfa I y EcoR V 

I t t II II II 11 II II I I II 1 I I site X :>ce , 

CCMGCTCGAAnCGCATGCTCTAGAGCTCGGTACCCGGGATCCTGCAGTCGACGCffAGGGATMCAGGGTMm 2320 
GGnCGAGCnAAGCGTACGAGATCTCGAGCCATGGGCCCTAGGACGTCAGCTGCC tMCCCTATTGTCCCATTA rGTCTA 

111- II -n II -II II M 1 I II I IT : : I • 

2244 2255 2262 2271 2279 2286 2296 2318 

2247 2255 2266 2275 2284 2292 

2249 2255 2265 2275 2284 2296 

2249 2261 2271 2279 2289 2297 

2255 2265 2276 2284 

2255 2262 2271 2279 2289 
2265 2275 2289 

2265 2275 2290 

2265 2272 2280 
2271 2279 
2272 2280 
2275 
2275 
2275 
2275 
2275 
2275 
2276 
2276 
2276 
2276 
2276 
2276 

2280 
2280 
2280 



FIG. lOA 
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Serf I 
Hex I 
Msp I 

Hpo n 

Dso V 

BstK I 

Ben I 
Xmo I 
Smo I 
Serf I 
Nci I 
Dsa V 
BslK I 
BsaJ I 
Sau3A I 

Mbo I Soc I 

Dpn n HgiA I 

Alu I Sfe I Dpn I Rso I 

Hind m Sfe I Pie I NIo IV Csp6 I 
Dde I Pst 1 Hinf I BstY I NIo IV EcoR I 

Sou3a I Ssc8337 I BomH I Kpn I Apo I 

Mbo I NIo m Taq I Alw I Ban I Alu I BccF I 

Opn II Sph I Sail Rmo I Ben I Eel 136 I Hoe m 

Dpn I NspC I Hinc II Mnl I Ave I 8sp1286 I Gdi H 

BslY I Nsp7524 I Acc I Bfa I BsaJ I Bon H Eoe I 

Bal II N3P I BspM I Xbo I Alw I Asp718 Toq I Bsr I 

IIIM II lit Ml M 111 MIm'^ II II III! 
ATCAGATCTAAGCnGCATGCCTGCAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAAnC^^ 2400 

TAGTCTAGAnCGAACGTACGGACGTCCAGCTGAGATCTCCTAGGGGCCCATGGCTCGAGCnAAGTGACCGGCAGC^ 
II III H 'til M -I II III Mill II I 1 I I 

2324 2336 2344 2354 2361 2369 2379 2387 

2324 2336 2348 2355 2364 2375 2389 

2325 2336 2348 2358 2365 2375 2389 

2325 2336 2348 2355 2365 2375 2390 

2325 2337 2349 2360 2369 2376 2391 

2325 2341 2360 2369 2381 

2328 2342 2351 2360 2369 2381 

2330 2342 235 1 2360 2370 

2331 2342 2361 2370 

2361 2375 
2361 2375 
2361 

2365 
2365 
2365 
2365 
2365 
2365 
2365 

2366 

2366 

2366 

2366 

2366 

236C 

Mae n 2366 
ScrF H 
EcoR n 
Dsa 3Z: 

BslN I Fok I Alu I 

Moe m BstK I Fno4H I Pvu H 

Mae n Bsr I BsoJ I Mse I Bbv I NspB 11 

ill III I 11 It 

TACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACnAATCGCCTTGCAGCAC^LTCCCCCmCGCCAGCTG 2480 
ATGnGCAGCACTGACCCTTTTGGGACCGCAATGGGnGAAnAGCGGAACGTCGTGTAGGGGGAAAGCGGTCGACCGCA 
III -11 I I •! I • MI 

2405 2413 2423 2440 2451 2471 

2410 2424 2451 2471 

2424 2457 2472 

2424 

itit FIG. lOB 

2430 
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NAMES: pTSm w 
pTKm &J 
pTTc 0) 



Km 
Sm 
Tc 



I-Sce I 
SITE 



mob RP4 




Ap 



R 



Construction: pGP 704 from De Lorenzo, with transposase 
gene and insertion of the linl<er[I— Scel] in NotI unique site 



FIG. I I 
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URA 3 




Construction: pD 123, from J.D. Boeke 
with insertion of a linker[I-SceI-NotI] in BamHI 



FIG. 12 



Sma I 8806 

Nar I 8778 
Kpn I 8810 
Sac I 8743 
EcoR I 8311 
Nde I 8263 



Xba I 9009 



Sac I 7243 



EcoR V 6418 



•Sac I 9033 

SCO I 436 




Sma I 5021 

Nco I 4931 



Kpn I 2205 

Xho I 2220 

Sal I 2226 

HinD m 2241 
EcoR V 2402 
Nho I 2829 

EcoR V 2940 

EcoR V 3015 

Xba I 3095 

Sac I 3212 

Nar I 3247 
Smo I 3275 
Kpn I 3279 



Nar I 4587 
Kpn I 4926 

FIG. I3A 
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A302 A\\\xv4s\x\\\\\\\\\\\\\\\\\\\\\vw 

160 - 500 
T62 n\\n\\\n\\\\\\snIx\\vnx\\\v\>^n\w 



FIG. ISA 



A-^no ■ 560 




D304 T"" Ixsw.^ 

1 40 •™*s\N\\N\\\\N\\V\.\\\.VS.\\\\S\NX\\S\V 



r^.. ^QO . 270 

uo. 230 , 450 

500 



M57 

rao ^20 . 310 




FIG. I5B 



> 



t40 •^N\\\^\N\\\\\\^^,vs^^\\^.\x\N\x\\\^\v 

Axno ^20 . 560 

iic.-7 160 ■ 500 

H81 2^2««MJk\\\\\\\\\\\\\\\\\\s^^ 
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FIG. 150 
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FIG. I5D 



I-Sce I MAP 
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FIG. I5E 
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GENETIC MAP 
PHTSICAL MAP 



I-Scc I 
sites 



E40 




CO 
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O C9 



I 
I 

T62 



I 
I 

G41 



0304 



TR 50 kb 



pUKC 145d 
pUKC 04(>cd 
pEXG 012cd 
pUKG 033cd 
pUKC 077cd 
pEKG lOOcd 



50-70 kb 



pEKG 025 
pBKC 082 
pUKG 129 
pUKG 074 
pEKG 019 
pUKG 150 
pUKG 136 I 



pUKG 134 
pEKG 023 
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pUKG 055 
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REPAIR USING ECTOPIC DUPUCATION 
DELETION AND GENE CONVERSION 



FIG. I9A 




I-Sce I 



OR 



REPAIR USING HOMOLOGOUS SEQUENCE ON PLASMID 
DELETION OF URA3 AND REPLACEMENT 
PLASMIO INTEGRATION 



FIG. I9B 
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